Accepted to ApJ: 14 October 2010 

Preprint typeset using I^'T^]X style emulatoapj v. 2/16/10 



O 
o 

Q 
in 



O 
U 

6 



THREE-POINT CORRELATION FUNCTIONS OF SDSS GALAXIES: LUMINOSITY AND COLOR 
DEPENDENCE IN REDSHIFT AND PROJECTED SPACE 

Cameron K. McBride^'^, Andrew J. Connolly'*, Jeffrey P. Gardner^, Ryan Scranton''', Jeffrey A. Newman^ 

Roman Scoccimarro'', Idit Zehavi", Donald P. Schneider* 

Accepted to ApJ: 14 October 2010 

ABSTRACT 

The three-point correlation function (3PCF) provides an important view into the clustering of 
galaxies that is not available to its lower order cousin, the two-point correlation function (2PCF). 
Higher order statistics, such as the 3PCF, are necessary to probe the non-Gaussian structure and shape 
information expected in these distributions. We measure the clustering of spectroscopic galaxies in the 
Main Galaxy Sample of the Sloan Digital Sky Survey (SDSS), focusing on the shape or configuration 
dependence of the reduced 3PCF in both redshift and projected space. This work constitutes the 
largest number of galaxies ever used to investigate the reduced 3PCF, using over 220 000 galaxies in 
three volume-limited samples. We find significant configuration dependence of the reduced 3PCF at 
3 — 27 h~^Mpc, in agreement with ACDM predictions and in disagreement with the hierarchical ansatz. 
Below 6 h^^Mpc, the redshift space reduced 3PCF shows a smaller amplitude and weak configuration 
dependence in comparison with projected measurements suggesting that redshift distortions, and not 
galaxy bias, can make the reduced 3PCF appear consistent with the hierarchical ansatz. The reduced 
3PCF shows a weaker dependence on luminosity than the 2PCF, with no significant dependence on 
scales above 9 /i~^Mpc. On scales less than 9 h~^Mpc, the reduced 3PCF appears more affected 
by galaxy color than luminosity. We demonstrate the extreme sensitivity of the 3PCF to systematic 
effects such as sky completeness and binning scheme, along with the difficulty of resolving the errors. 
Some comparable analyses make assumptions that do not consistently account for these effects. 
Subject headings: large-scale structure of universe - galaxies: statistics - cosmology: observations - 
surveys 
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1. INTRODUCTION 

In the current paradigm of structure formation, an al- 
most uniform distribution of mass at early times in the 
universe evolved through gravitational instability into 
the irregular and complex distribution that galaxies oc- 
cupy today. Gravitational dynamics are sensitive to cos- 
mology and depend on the spatial curvature of the uni- 
verse as well as the nature of its contents. A combi- 
nation of many observations inclu ding both constraints 
from l arge-scale structure (LSS; iTegmark et al.' 120041: 
[ Cole et al. 2005; Eiscnstcin et all 12005; Tcgmar ketaLl 
2006; I Sanchez et al . 2009) and t he co smic microwave 
background (CMB; Spergel et aLll2007l ) support a stan- 
dard model in accord ance with current obse rvations (see 
recent constraints in iKomatsu et al.l l2009fl . These ob- 
servations suggest a critically dense (spatially flat) uni- 
verse, consisting of a small amount of baryonic matter. 
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several times more mass in cold dark matter (CDM), 
and well over the majority of the current energy den- 
sity in some form of dark energy (A). This concordance 
model, referred to as ACDM, forms the basis of predict- 
ing LSS and the framework underlying statistical studies 
of galaxy distributions. 

Modern gala xy survey s, such as the Sloan Digital Sky 
Survey (SDSS; I York et ah 2000; Stoughton ct al. 2002), 
provide a wealth of information about large-scale struc- 
ture, galaxy formation, galaxy evolution and cosmology. 
The main workhorse for statis tical analyses has been the 
n-point correlation functions ()Peebleslll980[ ). The most 
common clustering measurements use the two-point cor- 
relation function (2PCF) or its analog in Fourier space, 
the power spectrum. Results from measurements of these 
statistics using observational datasets have been able to 
distinguish subtle differences in theoretical models of how 
galaxies occupy dark i natter halos (e.g. d epartures from 
power law clustering, 'Zeha vi et al.ll200^ . a comprehen- 
sive study of the effects of lumi nosity and color on galaxy 
clustering (Ze havi et al.l 120051 120101 ) , as well as provid- 
ing precise measurements of cosmological phenomena to 
better understand the nature of dark energy (e.g. baryon 
acoustic oscillations, lEisenstein et al.ll2005[ ). 

If the galaxy distribution was entirely Gaussian, the 
2PCF would provide a complete description of galaxy 
clustering. Although analyses of the CMB suggest that 
the primordial mass fluctuations in our uni verse appear 
extremely Gaussian (jKomatsu et al.l I2OO90 . we expect 
gravitational collapse to produce non-Gaussian signa- 
tures in the galaxy distribution that we measure today 
(Bernardeau et al., ,2002.) . As such, the 2PCF provides 



only a partial view of the full distribution and cannot 
sufficiently probe non- Gaussian signals. 

To investigate non-Gaussian structure, as well as shape 
information in these distributions, we require higher or- 
der clustering statistics. In the hierarchy of n-point cor- 
relation functions, the 3PCF is the lowest order s tatis- 
tic to provide information on shape ()Peebleslll98Cl[) . For 
example, this enables probes of the triaxial nature of 
halos and extended filaments within the "cosmic web". 
Measurements of higher order moments allow a more 
complete picture of the galaxy distribution, breaking 
model degeneracies describing cosmology aiid gala xy bias 
(jZheng and Weinberg! [2007t iKulkarni et al.ll2007[ ). 

The statistical strength of information in higher or- 
der moments of clustering can potentially rival that 
of th e two-point statistics (jSefusatti and Scoccimarra 
l2005f l. As a complement to bispectrum (Fourier trans- 
form of the 3PCF) and redshift-space analyses, we can 
use the projected 3PCF to sidestep redshift distortions 
in observational data by integrating out the effect of pe- 
culiar velocities from the density field. 

The additional information contained in higher or- 
der moments comes at a price. Their increased com- 
plexity make the measurements, modeling and interpre- 
tation difficult. Theoretically, non-linear contributions 
have significant non-trivial implications. Their calcula- 
tion gets computationally challenging and efficient algo- 
rithms become critically important. They require larger 
and cleaner galaxy samples as they show more sensitivity 
to observational systematics than the 2PCF. As it was 
recently described: "the overlap between well understood 
theory and reliable me asurements is in fact disquietingly 
smaU" (jSzapudil |2009[ ) . This work attempts to increase 
this overlap by leveraging the massive dataset available 
from the SDSS. 

Previous work has estimated the 3PCF from modern 
galaxy redshift surveys, including w ork on the the 
two-degree field ga, l axy re d shift survey (iJing and Borneii 
[200I iWang et aLl [20P: iGaztanaga et al.l 120051 ) and 
results froin earli e r SDSS data ( Kavq et aLl 120041 : 
iNichol et all l2006t IKulkarni et"all l2007| ). ^Fourier 
space analogs or related higher order st atistics have 
also b een measured for these d atasets (iVerde et al.l 
200I: IPan and Szapudil [20051: IHikage et al. l 120051: 



Nishimichi et al.ll2007D . ' 



This work is the first of two papers analyzing the re- 
duced 3PCF on SDSS galaxy samples, where this work 
focuses on details of the measurements as well as the 
dependence of clustering from varying the galaxy sam- 
ple's luminosity and color. The second paper (McBride, 
et al. 2010) utilizes the configuration dependence to 
constrain non-line ar galaxy- mass bias param eters in the 
local bias model (jFrv and Gaztanagalll993| ). Through- 
out our study, we assume a standard flat ACDM cos- 
mology where V.^ — 0.3, ^\ — 0.7, and Ho = 
/ilOOkms-^Mpc"^ . 

This paper is organized as follows. We discuss the 
SDSS data and the simulations in [J2l We review the 
relevant theory and methods of our analysis in fJS] We 
introduce our measurements in []4j which include mea- 
surements of the 2PCF in §4.11 the equilateral reduced 
3PCF in §4.2) and configuration dependence of the re- 
duced 3PCF in ij4.3l We resolve the covariance of our 



measurements in i i4.4l and investigate the effects of large 
oyerdense structures, such as the Sloan Great Wall, in 
HA.5[ We discuss our results and compare to previous 
studies in f|5l Finally, we review our findings in Sj6l 

2. DATA 
2.1. SDSS Galaxy Samples 

The SDSS (lYork et al.l l2000t IStouehton et~all I200I) 

employs a dedicated 2.5 meter telescope (|Gunn et al.l 
11998, 2006) at Apache Point Observatory in New Mexico. 
Nearly a quarter of the sky was imaged in five bandpasses 
(|Fukugitaet~aII 119961 iSmith et al. I2002D . reduced by a 
processing pipeline (iLupton et al. I2001D, and calibrated 
for accurate astrometrv (iPier et al.ll2003D and photom- 
etry (Hogg et al. 2001; llvezic et all 120041 : iTucker et afl 
[20061; [Padmanabhan et al."2008). 

For our analysis, we use spectros copic galaxy data de - 
fined as the Main galaxy sample ([Strauss et al.l [200^ . 
The algorithm which defines this selection targets about 
90 galaxies per square degree that turn out to have 
a median redshift of 0.104, a high completeness, and 
an accurate statistical separation of stars and galaxies 
whic h prevents stellar co ntaminants in the galaxy sam- 
ples (jStrauss et al.l [200^ . A fiber based spectrograph 
observes the targets using an adaptive tiling algorithm 
([Blanton et al.|[200l . 

The SDSS galaxy samples are made more readily avail- 
able via the New York University Value-Added Galaxy 
Catalog (NYU-VAGC: Blanton et al. 2005a) . This cata- 
log provides detailed characterizations of the sample ge- 
ometry and completeness as well as correcting for known 
systematics that are pertinent to large-scale structure 
analyses. These include ii'-corrections, passive evolution 
corrections, and "fiber collision corrections" that account 
for missing spectra due to galaxy pairs that are closer 
than fibers can be positioned on the sky (55"). We con- 
duct our analy sis of clustering measurements us ing galax- 
ies from DR6 (l Adelman-McCarthv et~aI[|2008D . 

The NYU-VAGC galaxy catalog corresponding to DR6 
contains ~ 470 000 galaxies covering 6377 square de- 
grees of unmasked area (we neglect regions around bright 
stars). We select volume-limited sub-samples from this 
flux-limited parent catalog to analyze well-defined sam- 
ples, which still contain a large number of galaxies 
(221 500 unique galaxies in three samples). We do not an- 
alyze a flux-limited sample. Using volume-limited sam- 
ples prevents systematic effects on clustering measure- 
ments from the inaccuracies in the radial selection func- 
tion, and allows a cleaner association of clustering differ- 
ences to properties of the galaxy sample, such as lumi- 
nosity and color. 

We construct the volume-limited samples by using cor- 
rected absolute r-band magnitude as a function of red- 
shift. Our absolute r-band magnitudes use the NYU- 
VAGC convention: absolu te magnitudes represe nt values 
at z = 0.1 (see details in lBlanton et al.ll2005aD . These 
are often written as Mov, which we simplify to Mr. Ba- 
sically, passive evolution and iiT-corrections use the me- 
dian redshift of the flux-limited SDSS Main sample to 
minimize uncertainties in their empirical determination. 
Formally, Mr — Mr ~ 5 log h but our unit convention 
sets h = 1 making the last term unnecessary. We define 
a sub-sample of objects by selecting bounding redshifts 



Name 



Magnitude 



Specifics of SDSS galaxy samples 

Volume Number of 
?i~^Gpc'^ Galaxies 



Redshift 



Blue / Red 
Galaxies 



Density 
10-=^ h-^Mpc^ 



BRIGHT Mr < -21.5 0.010-0.210 0.1390 37875 — 0.272 
LSTAR -21.5 < Mr < -20.5 0.053-0.138 0.0391 106 823 46 574/60 249 2.732 

FAINT -20.5 < Mr < -19.5 0.034-0.086 0.0098 76808 39360/37 448 7.849 

Table 1 

The redshift limits, volume, total number of galaxies, number per color sample, and completeness corrected number density are shown 
for the three galaxy samples constructed from the SDSS DR6 spectroscopic catalog. We selected these by cuts in redshift, z, and 
corrected (K-correction and passive evolution) absolute r-band magnitude, Mr, to create volume-limited selections. These samples 

represent 221 506 unique galaxies. 



which correspond to a specific luminosity range. We de- 
fine three samples: a BRIGHT sample with Mr < —21.5, 
LSTAR with -21.5 < Mr < -20.5, and FAINT with 
—20.5 < Mr < —19.5. We show the selection of these 
samples in Figm-e[TJ We tabulate their properties, such 
as the redshift range, volume, number of objects, and 
completeness corrected number density in Table [TJ 

Of particular concern might be the "fiber collision cor- 
rections" in the galaxy data. The correction used in the 
NYU-VAGC assigns the redshift of the nearest angular 
neighbor to a galaxy that does not have a redshift, reduc- 
ing the number of collisions from ~ 7% to less than ~ 4%. 
This corre ction was investigated us ing galaxy mocks in 
the 2PCF (jZehavi et al.ll2Q02ll2005[) . and concluded that 
it works incredibly well for separations above the collision 
scale with only a small residual effect for redshift space 
quantities on all scales. We do not expect this correction 
to be appropriate for smaller scales. However, the fiber 
collision correction does not significantly affect our anal- 
ysis as we focus on scales above 3 h~^Mpc. The collision 
scale at the largest redshift in our galaxy sample remains 
well below this value, as 55" corresponds to a separation 
of - 0.13 /i-^Mpc at z = 0.21. 

Galaxy clustering measurements f or SDSS have been 
show n to vary strongly with color (jZehavi et alj 120051 
|2010[) . We investigate the color dependence on two of 
our samples (LSTAR and FAINT) by defining a simple 
red-blue split based on the g — r color. There is a strong 
bi-modality between red and blue sub-samp les that varies 
with absolute magnitude, as shown in iBaldrv et al.l 
(|2004D . We adopt a simple linearly sloped color cut that 
depends on absolute r-band magnitude to account for the 
bi-modal distribution which separa tes the red sequenc e 
from the blue cloud, as described in lZehavi et al.l (J2005D . 
Specifically, the g — r color limit is 



{g - r)H™ = 0.21 - O.OSAf, 



(1) 



Radial distances and absolute magnitudes are calculated 
using a flat ACDM cosmology with (57,„ — 0.3 , J7a = 
0.7). 

2.2. Hubble Volume Simulation 

We compare the SDSS galaxy distribution with the 
structure in cosmological A^-body simula tions. We use 
the Hubble Volume (HV ) simulations (jColberg et al.l 
120001: lEvrard et all I2002D that were completed by the 
the Virgo Consortium. We choose the simulation with 
ACDM cosmology: {fl^ = 0.3, r^A = 0.7, H^ = 
70kms-iMpc"\ as = 0.9). This HV simulation con- 
sists of 1000^ particles in a box of (3000 h-'^Mpc)^ 
volume which evolves particles of mass rripart = 2.2 x 




Figure 1. This figure depicts the selection of three volume- 
limited samples from the SDSS DR6 release detailed in Table [T] 
and shown as shaded boxes corresponding to BRIGHT, LSTAR, 
and FAINT (from top to bottom). Each point represents a galaxy, 
and the blue and black colors depict the two overlapping parent cat- 
alogs (a single galaxy can be in both catalogs). The black points 
correspond to NYU-VAGC bright which does not include a limit 
on a galaxy's bright apparent magnitude. The blue points, over- 
plotted on the black, correspond to NYU-VAGC safe and include 
a bright cut to the apparent magnitude (see Blanton et al. 2005a). 
We select redshift limits where galaxies of all included magnitudes 
can be seen at both the inner and outer boundaries. 



10^^ H^-^Mq from an initial redshift of Zinu = 35 to the 
current epoch using a Plummer softened gravitational 
potential where the softening length is 0.1 h~^Mpc. To 
better compare with observational galaxy samples, we 
utilize their "light-cone" realization. The light-cone out- 
put of the HV ACDM simulation was kindly provided by 
Gus Evrard and Jorg Colberg. 

We treat the particles of the HV simulation like a po- 
tential observation and apply observational constraints 
to their distribution. We filter particles to match the 
same angular footprint of the SDSS geometry and radial 
distance to exactly reproduce the volume of the corre- 
sponding galaxy sample. We apply redshift distortions 
by using the peculiar velocity of the DM particle. 

Finally, we randomly downsample the number of dark 
matter particles to make the computational time of the 
analysis more manageable. To verify this downsampling 
does not introduce a systematic bias, we perform a few 
measurements on several downsampled realizations and 
compare with the full DM distribution. We find mini- 



mal differences that are within expectations for Poisson 
samphng. 

3. THEORY & METHODS 

The n-point correlation functions provide a statistical 
description of LSS and remain a standard tool for quan- 
tifying the structure of the mass field and galaxy distri- 
bution (Peeble Jl98Cl[ ). We summarize the basics relevant 
to this analysis below. 

We can define density as a function of position, p{x), 
that has an average density of p. We define the fractional 
overdensity about the mean at a local point as 



Six) 



P 



1 



(2) 



Casting the density in terms of the overdensity effectively 
removes the first moment of the S field, i.e. {S{x)) — 0, 
where the () denotes an ensemble average. The two-point 
correlation function (2PCF) can be defined in terms of S 
values characterized by the separation of two positions, 
''12 = 1^1 — X2\, by assuming homogeneity and isotropy, 
and we write the 2PCF as 



ari2) = {S{x,)S{x2)) 



(3) 



A Gaussian field refers to any distribution that is fully 
described (statistically) by only its first and second mo- 
ments (e.g. a mean and variance). For the S field, the 
mean is zero and ^(r) successfully describes all clustering 
properties of a Gaussian field. Any other possible dis- 
tribution is then non- Gaussian and can have non-trivial 
higher order moments (i.e. non-zero higher order correla- 
tion functions). Higher order functions can be similarly 
defined in terms of overdensity fluctuations, where the 
three-point correlation function (3PCF) is given by 



C(''l2,''23,''3l) = {S{X1)6{X2)S{X3)) 



(4) 



Instead of a single dependent variable, such as ri2 in 
C(^i2)j we see the 3PCF relies on three separations nec- 
essary to parameterize triplets. Further higher order cor- 
relation functions (greater than n = 3) require even more 
variables and quickl y result in a "combinatorial explo- 
sion" of parameters ()Szapudill2Q09D . 

The hierarchical ansatz posits that the 3PCF can be 
estimated by a cyclic combination of respective 2PCFs: 



Ci'ri2,r23,r3i) w Q [C1263 + 6361 + 61C12] 



(5) 



where we have simplifled notation with ^12 — S,{ri2), and 
Q denotes a scaling constant to adjust the amplitude. 
Initial measurements of 3PCF using angular surveys sug- 
gested that t he hierarchical ansatz held at small scales 
with Q « 1.3 (IPeebleslll98(ih . 

What was originally called the hierarchical amplitude 
(Q) can be rewritten as a function, specifically 



Qiri2,r23,r3i) 



C{ri2,r23,r3i) 



<^12f23 + C23'?31 + C31C12 



(6) 



This definition provides a useful normalization, and 
Q(?"i2, ''23, ''31) is commonly referred to as the normal- 
ized or reduced 3PCF. As long as the the 2PCF remains 
well above zero, i.e. the denominator in ^, the value of 
the function Q roughly equals unity regardless of scale. 
This functional form was later justified by gravitational 



perturbation theory, as the evolution of the 3PCF de- 
pends on quadratic terms in the equat ions of motion en- 
capsu lated in the square of the 2PCF (jBernardeau et al] 
|2002[ ). An additional benefit of utilizing such a "ratio 
statistic": we expect Q to be insensitive to both time 
and cosmology. To leading order, Q only depends on the 
spectral index and trian gle configuration (see Figure 9 in 
iBernardeau et al.ll2002[ ). 

3.1. Redshift Distortions and Projected Correlation 
Functions 

Redshift distortions destroy the isotropy of the galaxy 
distribution. They arise from our inability to disentan- 
gle the peculiar (dynamical) velocity of a galaxy's mo- 
tion from the Hubble flow when determining the radial 
distance via the spectroscopic redshift. We characterize 
the correlation functions by a separation in real space 
(i.e. non-distorted), generally denoted as r. In practice, 
observations yield the distorted distance, and we refer 
to this redshift space separation as s. When the angle 
subtended by s is small (i.e. the plane-parallel approx- 
imation), we can decompose the redshift space distance 
into a line-of-sight (tt) and projected separation (r^) such 

that s = (tt^ + Tp)^/^. This effectively encapsulates the 
redshift distortion in the tt coordin ate. ^_^ 

Redshift distortions (iJacksonI 119721 : 

iSargent and Turneii 119771; [Peebles! 11980 ) produce 
two clearly ob servable e ffects on the galaxy distribution 
(reviewed in Harniltori 1998). At small scales, gravi- 
tational collapse becomes highly non-linear and large 
peculiar velocities produce elongated structures in the 
radial direction affectionately referred t o as finqers-o f- 
god (the dominant effect at small r^, see lJacksonlll972l) . 
Larger scales exhibit a flattening from linear infall, 
observed as less obvious structures perpendicular to the 
line-of-sight ( Kaiser 1987 ). 

We can parameterize the redshift space 2PCF in terms 
of the plane-parallel approximation, ^(s) -^ £,{rp, tt), and 
integrate along tt to produce the projected 2PCF: 



Wp{rp) = 2 



^(rp,7r)d7r 



(7) 



Physically, we project the 3D correlation function onto a 
2D surface of the sky in a moving annulus of fixed width 
according to TTmax- The projected 3PCF and its reduced 
form can be analogously defined as: 

Cproj {rpl2 , rp23 , ''p31 ) — 

C{rpl2,rp23,rp3l,TTl2,TT23)dTTl2d'K23 , (8) 



Qproj {fpl2 , rp23 , ''p31 ) 



Cproj {'''pl2,'''p23,rp3i) 



Wpl2Wp23 + Wp23Wp31 + Wp3lWpl2 



(9) 



We extended our simplified notation such that rpi2 is the 
projected analog to ri2, and Wpi2 = Wp{rpi2). We choose 
TTmax = 20 h~^Mpc for the measurements we present in 
this work (discussed further in Appendix |B| . 

Our notation is as follows. We show measurements of 
the 2PCF as a function of redshift space separation s. 



For projected space measurements, we keep the 2PCF 
characterized by a single dependent variable: rp. The 
3PCF is a function with three dependent variables and 
we continue the notation: s or Vp for redshift and pro- 
jected space separations, respectively. Finally, our use of 
r is more general and can refer to separation in (theoret- 
ical) real, redshift or projected space. 

3.2. Estimating the Correlation Functions 

We estimate the correlation functions using normalized 
pair counts for the 2PCF and triplets for the 3PCF. We 
use a class of unbiased a.nd m inimal variance estimators 
(sec Szap udi and Szalavl 119981) to optimally correct for 
edge effects. Th e estimator we use for th e 2PCF was 
first presented bv lLandv and Szalavl ()1993l ): 



£.LS 



DD - 2DR + RR 
RR 



(10) 



where DD denotes the normalized data-data pairs, RR 
represents the normalized random-random pairs and 
DR corresponds to the normalized cross count of data- 
random pairs. The LS estimato r was extended for 
all n-p oint correlation functions by iSzapudi and Szalavl 
(|1998f l. and we estimate the 3PCF as 



Css 



DDD - 3DDR + 3DRR - RRR 
RRR 



(11) 



As before, DDD represents the normalized count of 
data-data-data triplets, DDR corresponds to data-data- 
random, etc. 

We construct random catalogs that are a factor of 5— 10 
greater in density than the data. We find this ratio suf- 
ficient to keep the shot noise contribution of the random 
catalogs well below that of the data for all triplet counts, 
as well as small enough to be computational manageable. 
We designed the random catalogs to have the exact red- 
shift distribution as the data: for each galaxy, we gener- 
ate 5 — 10 points with random angular coordinates but 
identical redshifts. Finally, we match both the angular 
footprint and volume of the corresponding data samples. 

For our pair and triplet counts, we employ an ex- 
act n-point calculator im plemented in JVtropy , a par- 
allel kd-tree framework (iGardner et al.l l2007f) . The 
application, ntropy-npoin t, utih zes an efficient algo- 
rithm developed by Moore et al.l ([2001) and discussed 
in iGrav et al.l (2004). Their original implementation, 
npt , is publicly available and has been used to investi- 
gate the 3PCF (iNichol eraIll2Q0l iKulkarni et alll2007l: 
iMarin et al.ll2008l) . The independent n-point implemen- 
tation based on Atropy shows enhanced runtime perfor- 
mance, true parallel capability, and the ability to search 
di rectly in projected c oordinates of r-p,7r (see details 
in IGardner et al.l 120071 ). We define the coordinates n 
and Tp for each point pair, where the 3D separation is 
n.2 = a^i — a'2 • We use the unit direction to the midpoint 

of the separation, rm to calculate n = r^ ■ n.2 and find 

22 2 

rp = ri2-T^. 

We stress that the ability to utilize massively paral- 
lel computing platforms proved extremely important, as 
the projected 3PCF required almost two orders of mag- 
nitude more time to compute than the spatial 3PCF. We 
utilized hundreds of processors at a time and required 



^ 300, 000 CPU hours to complete the n-point calcula- 
tions presented here. 

3.3. Triplet Configurations and Binning Scheme 

The full 3PCF is a function of three variables that char- 
acterize both the size and shape of triplets. A natural 
and unique description of a triplet is the length of each 
side of the triangle that connects the points: ri, r2 and 
r^ where connectivity is assumed by ^3 = ri -I- ^2 ■ 

In perturbation theory, it is most common to see trian- 
gles described by two side lengths (ri and r2) and their 
opening angle {9) defined by the cosine rule: 



cosf 



rl-rl 



2rir2 



(12) 



We find this last characterization the most natural and 
intuitive and will use it to describe our 3PCF measure- 
ments. When « or f* TT, we refer to triplets as "col- 
lapsed" or "elongated" respectively, both of which have 
two sides being very close to co-linear. As 9 approaches 
7r/2 the triplet forms a right triangle which we will re- 
fer to as a "perpendicular" configuration. The triangle 
shape, or configuration dependence, describes a function 
of 9 where ^^strong configuration dependence" means a 
significant amplitude difference in the reduced 3PCF, 
(9(^17 ''2 J ^), between co-linear and perpendicular config- 
urations and "weafc configuration dependence" shows lit- 
tle or no change of in Q{ri, r2, 9) wit h 9. This terminol- 
ogy relates to that used by Gaztaiia ga and Scoccimairol 
(2605). 

While it might appear this description is both triv- 
ial and pedantic, we caution that these mappings are 
non-linear. Triangle descriptions such as side-side-side 
and side-side-angle remain completely equivalent for ex- 
act values when the triangle vertices are infinitesimal 
points. However, these transformations will not be exact 
for small volumes, as in the case of estimating correlation 
functions where we accumulate counts within bins of a 
finite width. Care must be taken to prevent significant 
discrepancies in 3PCF measurements if the parameteri- 
zation used for the calculation differs from the one that is 
modeled. We carefully consider this subtlety to correctly 
interpret our measurements. 

We choose (ri, r2, 9) to parametrize our 3PCF mea- 
surements, although we measure the 3PCF using bins 
defined by (ri,r2,r3) and use (fT2|) to convert r^ to 9. 
Even though we treat the reduced 3PCF as a function 
of three variables, Q{ri, r2, 9), we may often denote it as 
Q{9) of even Q for simplicity. 

A good choice of binning is not straightforward for the 
configuration dependence of the 3PCF. Since neighboring 
bins need to be tightly packed to measure Q{9), choosing 
a bin- width based on log r will be too large and insuffi- 
cient to resolve a slow variation. On the other hand, 
a small bin-size will quickly become under-sampled, as 
triplets are characterized by three variables. We choose 
to employ linearly spaced bins in 0, and use the co- 
sine rule in (fT2|) to define the midpoint r^. We choose 
a bin-width as a fraction, /, of the measured scale, r, 
such that Ar ~ f x r and a bin at r is measured be- 



tween (r 



-j^). This scheme was independently 



found to be us e ful in a theoretical study of the 3PCF 
bv , Marin et al.l (|2008l ). We compare measurements be- 
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tween three volume-limited galaxy samples which have 
very different number densities (see Table [1]), and select 
a single fraction / to apply to all samples and scales 
for a fair comparison. We set the value of the frac- 
tional bin-width by requiring a "reasonable sampling" 
of the sparsest dataset: the BRIGHT galaxy sample. 
Using a fractional bin-width of 25% (/ — 0.25) results 
in several hundred triplet counts at the smallest scales 
(ri = 3 h^^Mpc), and we use this / value for all mea- 
surements. This produces relatively wide bins with some 
that physically overlap (the same triplet is counted more 
than once). We note two observable consequences: (1) 
a slight damping of the 3PCF near 9 fa n, and (2) an 
induced correlation between some neighboring 9 bins. 
Qualitatively, this should not pose a problem since all 
comparisons use the exact same scheme (including A^- 
body results). Quantitatively, we account for this by 
using the full covariance matrix. We furt her d iscuss the 
implications of binning in the appendix, § IA.3I 

3.4. Estimating the Covariance Matrix 

We measure the correlation between measurements by 
empirically calculating the covariance matrix. Given a 
number of realizations, N, a residual on Q can be defined 
as 

Af = ^^-^ , (13) 



for each realization (k) and bin (z) given a mean value 
(Qi) and variance (af) for each bin over all realiza- 
tions. We use Q as a general placeholder for any mea- 
sured statistic (2PCF, 3PCF, etc). A covariance matrix 
can be estimated from the data itself using a leave one 
out cross validation method, more commonly referred 
to as iackknife re-sampling (see further description in 
iLuptonet al.ll2001[) . 

The number of jackknife samples (which we denote 
with N) are not independent realizations, and the jack- 
knife covariance matrix can be estimated by: 
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J^AfAj, (14) 



k=l 



where Cij denotes the typical unbiased estimator of the 
covariance when computed on jackknife samples. Jack- 
knife re-sampling has been shown to be reliable on scales 
up to 30 /i~^Mpc for the 2PCF on spectroscopic galaxy 
samp les when compared with indep endent mock cata- 
logs ([Zehavi et al l [200l [200l [20Q5r). H owever, a more 
elaborate study bv INorberg et al.l (|2009D highlights po- 
tential problems and caveats of using such an "internal" 
estimate of the errors. We defer questioning the valid- 
ity of jackknife re-sampling methods for estimating the 
covariance of the reduced 3PCF to a companion paper 
(McBride et al. 2010). They compare jackknife estimates 
to those from mock galaxy catalogs generated from A^- 
body simulations. They conclude that while jackknife 
error estimates do not exactly agree with mocks, they ap- 
pear sufficient for the analysis presented here (McBride 
et al 2010). 

We generate our jackknife samples using the pixeliza- 
tion scheme of STOMP 0. This code can account for the 



irregular geometry of the SDSS data, an d has been used 
to qu antify errors in angular clustering (jScranton et al.l 
l2005f ). The jackknife regions are selected to maintain 
equal unmasked area on the sky. In the SDSS samples 
we use, survey depth does not vary over the sky, which 
makes equal area consistent with equal volume. We want 
to resolve the covariance of Q{9) for 15 bins in 9 between 
and TT, therefore using less than 15 jackknife samples 
should formally result in a singular covariance matrix. 
We choose 30 jackknife samples where the regions are 
each ~ 180 square degrees in unmasked area (see dis- 
cussion in McBride et al. 2010 where they investigate 
varying the number of jackknife samples for these Q{9) 
measurements) . 

4. SDSS MEASUREMENTS 
4.1. Two-Point Correlation Function 

We first investigate the 2PCF of our samples. While we 
later focus our clustering analysis on the reduced 3PCF, 
Q{ri,r2,9), it is instructive to directly examine the 2PCF 
since Q is the ratio of the connected 3PCF, ({ri,r2,9) 
divided by products of the 2PCF, £,{r), such that Q oc 
(■/■C^ as shown in © . We choose equal width bins in log r 
to measure the 2PCF. 

The galaxy c orrelation function was firs t shown to be 
a power law by iTotsuii and Kiharal ()1969[ ) using galaxy 
data from the Lick Observatory (Shane and Wirtanen, 
1967). Along with our measurements, we plot a fiducial 
power law model. 
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^ http://code.googlo.eom/p/astro-stomp/ 



which we adopt for comparison fr om a detaile d stud y 
of clustering in SDSS galaxies bv iZehavi et al.l ([2005). 
Given the precision of modern measurements, recent 
work has noted signi ficant departures of the galaxy 2PCF 
from a power law (jZehavi et al.l 120041 ) where the data 
are better described b y the halo model (reviewed in 
iCoorav and Shet"hll2002[ ) . Nevertheless, a power law pro- 
vides a simple and convenient comparison. 

We plot the 2PCF in both redshift space, ^(s), and 
projected space, Wpirp), for our three galaxy samples 
in Figure [2J At larger scales, ^(s) runs almost parallel 
to the fiducial model for all galaxy samples. We note 
the reduction in strength of ^(s) at separations below k, 
5 ft.~^Mpc in Figure^ At these scales this is an effect of 
redshift distortions where the fingers-of-god make galaxy 
pairs at close separations less likely and reduce the small 
scale correlation function. 

In contrast to ^{s), the Wp{rp) does not exhibit the 
same reduction in power, and remains close to a power 
law for small projected separations. This confirms our 
expectations that the projected Wp{rp) should be less af- 
fected by redshift distortions ~ especially at the smaller 
projected scales (see Figure[2l)- At large rp, Wp{rp) shows 
a reduction of power due to truncating the tt integration 
at TTinax = 20 /i~^Mpc. Wc show this effect by using 
our fiducial model. Assuming a perfect power law in real 
space, we depict the resulting Wp{rp) with two lines in 
Figure [21 where the dotted line uses the same TTmax as 
the data and solid line depicts the full projection (i.e. 

TTmax = Oo). 
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Figure 2. The redshift space 2PCF, ^(s) in the left panel, and projected 2PCF, Wp{rp) in the right panel, for our three SDSS samples. 
Brighter galaxy samples exhibit more clustering (e.g. higher amplitude) than fainter samples, in agreement with published analyses of 
SDSS galaxies (Zchavi et al. 2005). Error bars denote Icr uncertainties calculated from 30 jackknife samples. 



In both ^(s) and Wp{rp) we note that brighter galax- 
ies exhibit stronger clustering when compared to fainter 
galaxies. We can understan d this in terms of a simpl e 
galaxy- mass bias parameter (jFrv and Gaztanagal 119931 ) . 
where ^ oc 6^ and the linear bias, 6, increases with galaxy 
luminosi ty; this has been studie d in detail for SDSS 
galaxies (fZehavi et al.ll2005l l201Cl[ ). Our measurements 
are in agreement with their analysis. The galaxy 2PCF 
depicts a higher amplitude than the fiducial model, which 
is consistent with the luminosity of our galaxy samples. 

4.2. Equilateral Three-Point Correlation Function 

The simplest analog to the 2PCF is the equilateral re- 
duced 3PCF, Qeq{r)- As each side of the triangle formed 
by a triplet corresponds to the same scale (ri — r2 = 
r^ = r), Qeq can easily be characterized by a single sep- 
aration: 

C(r, r, r) 



Qeq{r) 



3 CM 



(16) 



Qeq is not sensitive to shape, but it provides a clear sense 
of scale dependence in the reduced 3PCF. When Q^q w 1, 
the number of triplets exactly correspond to those ex- 
pected from the 2PCF; when Qeq is above or below 1, 
there are more or less triplets, respectively. Like the 
2PCF, we choose bins of equal width in logr. 

We show Qeq in both redshift and projected space 
for our three galaxy samples in Figure [3l In redshift 
space, Qeq appears flat and never exceeds 1, showing 
very little difference in terms of luminosity. In the pro- 
jected measurement, small scales reveal Qeq > 1 until 
Tp « 2 — 3 /i~^Mpc, where it approximately reproduces 
the amplitude of the redshift space measurement. We 
expect larger scales to dip below one, as ({r) drops more 
rapidly than pairs in the 2PCF for the same scale. In- 
tuitively, the number of equilateral triplets of a set side 
length become more rare than the number of available 
pairs of the same separation length. 

On small scales redshift distortions destroy small scale 
triplets in a manner similar to what we discussed for 



pairs in the 2PCF. We notice that redshift space Qeq 
remains fiat in Figure |3l but the projected Qeq recov- 
ers power at small rp. This suggests the fiattening of 
the redshift space measurement is a result of redshift 
distortions. This interpretation is supported by a the- 
or etical compa rison of the real and redshift space Qeq 
in iMarin et al.l (|2008[ ) (see their Figure 1). We notice a 
slight variation of the measurement with sample luminos- 
ity, but not nearly as pronounced as seen in the 2PCF, 
and only significant for a few points. At the largest scales 
(r > 10 h~^Mpc), the reduction in amplitude of Qeq and 
increased size of the errors suggest that we are limited 
in signal by the finite volume of the galaxy sample. The 
effects appear worst for the faintest volume-limited sam- 
ple, which is expected since it also occupies the smallest 
volume in comparison with the larger volume brighter 
samples (see Table [T]). Finite volume effects have been 
shown to cause a rapid reduction i n amplitude for hig her 
order measurements of clustering (|Scoccimarrcill2000l) . 

4.3. Configuration Dependence of reduced 3PCF 

We focus our investigation on the shape or config- 
uration dependence of the reduced 3PCF, Q{ri,r2,0). 
The reduced 3PC F w a,s analyzed in reds hift space by 
iKavo et al.l ()2004[ ) and iNichol et all (|2006D using a pre- 
vious release of SDSS galaxy data: "samplel2" - a ver- 
sion between the first and second data releases covering 
^ 2406 square degrees. Unlike these previous studies, 
we restrict our analysis only to volume-limited galaxy 
samples, and jointly investigate redshift and projected 
space measurements. We measure the reduced 3PCF for 
SDSS DR6 galaxies at three scales, which we specify by 
the smallest side of the triangle, ri. We choose r2 so 
that the ratio of the first two sides stays fixed such that 
r2/ri = 2. We then measure the reduced 3PCF as a 
function of opening angle between these two sides, 0, 
regularly spaced between and tt. The resulting scale 
we probe varies from ri, when 9 = (i.e. "collapsed") 
to (ri -\-r2), when 9 = tt (i.e. "elongated"). Specifically 
we measure triplets on scales between 3 — 9, 6 — 18 and 
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Figure 3. The equilateral reduced 3PCF, Qeq{r), for our three SDSS galaxy samples, both in redshift space (left) and projected space 
(right). The equilateral reduced 3PCF is characterized by a single scale for simplicity: Qeqi''') = Q('', '"i r)- We choose a bin-size to be equal 
width in logr analogous to the binning we used for the 2PCF. Error bars denote la uncertainties calculated from 30 jackknife samples. 



9 — 27 h ^Mpc corresponding respectively to ri =3,6 
and 9 h~^Mpc. 

We introduce the configuration dependence for each 
sample and scale in Figure |4l where we overlay the re- 
duced 3PCF in both redshift and projected space. On 
smaller scales (left side of Figure HJ , we see weaker con- 
figuration dependence in the reduced 3PCF (i.e. smaller 
change in Q{0) with 6) than at the larger scales (right 
side). 

We notice a "V-shape" at ri = 9 /i~^Mpc, present 
in both redshift and projected space measurements of 
Figure ID We interpret this as a statistical signal of 
filamentary structure that becomes more pronounced 
as the scale increases: an over-abundance [Q > 1) of 
CO- linear triangles with an under-abundance (Q < 1) 
of perpendicular configurations. This characteristic V- 
sha pe is predicted from gravitational perturbation the- 
ory (iBernardeau et al. 2002) as well as results from sim- 
ulatio ns (jGaztanaga and Scoccimarroll2005l : iMarin et al.l 
[200l . 

We identify the effect of redshift distortions on the 
small scales in the redshift space reduced 3PCF in two 
ways: (1) almost no configuration dependence in Qz{0) 
and (2) Qz{d) < 1 for all configurations, showing a de- 
ficiency of triplets due to non-linear velocities stretch- 
ing compact structures beyond the scales probed (i.e. 
fingers-of-god). On small scales, we notice that Qproj 
has both a higher amplitude and exhibits a stronger con- 
figuration dependence in comparison with Qz- When 
comparing with the 2PCF (Figure[2]), the projected mea- 
surements are larger in amplitude. However, Qproj is 
normalized by Wp{rp), so we expect Qproj to match Qz 
and remain close to 1 in value. The increased amplitude 
of Qproj is attributed to recovering signal that was lost to 
Qz due to redshift distortions. We consider the stronger 
configuration dependence of Qproj refiective of the true 
galaxy distribution, as a simple projection should not in- 
crease the configuration dependence. We conclude that 
the projected measurement recovers signal that is de- 
stroyed in the redshift space measurement due to non- 



linear distortions. However, we expect that our choice 
of Ttmax — 20 h^^Mjpc wiU permit redshift distortions to 
affect these projected measurements, albeit in a reduced 
capacity with respect to redshift measurements. As the 
scale increases beyond 9 h~^Mpc, Qproj and Qz gener- 
ally converge (right column of Figure U) . 

We see a dramatic difference in the size of uncertain- 
ties between these measurements. For each individual 
sample and statistic, there are two important aspects to 
consider. First, the small scales become sensitive to sam- 
pling where a galaxy sample with a lower number density 
will be less well determined (shot noise limited) . Second, 
the larger scale measurements will show more sensitivity 
to volume effects (being dominated by sample or cosmic 
variance). We see this same behavior for Qeq in Figure [31 

The uncertainties increase with larger scales, especially 
for the fainter galaxy samples. We actually expect the 
growth of errors from the "ratio" static of the reduced 
3PCF since we predict the reduced 3PCF to be approx- 
imately unity at all scales. We interpret the general in- 
crease of errors on Q as the higher order 3PCF (numera- 
tor in Q) becoming uncertain at a much faster rate than 
the 2PCF (denominator). 

We notice the uncertainties on the FAINT sample ap- 
pear much larger than measurements on the two brighter 
samples, especially at larger scales. This is not Poisson 
uncertainty as the sample contains over 76,000 galaxies, 
a factor of two greater than the BRIGHT sample. The 
FAINT galaxy sample occupies a much smaller volume, 
and a few large structures dominate the errors in the 
large scale measurements (see discussion in i j4.5|) . The 
observed increase of uncertainties with fainter galaxy 
samples is generally a consequence of the smaller vol- 
ume being probed by the fainter volume-limited samples 
(see Table ip. 

The projected Q{0) shows larger uncertainties than 
redshift space measurements at all scales. This is a natu- 
ral side effect of the projection: a given scale in r^ repre- 
sents a lower bound on the redshift space scale, s, where 
the upper bound is s = (7r^ax + ''p)^''^- Since Q{9) at any 



DR6: T^-.T^ = 3:6 Mpc/h 



DR6: Fjrrg = 6:12 Mpc/h 



DR6: TyT^ = 9:18 Mpc/h 




e / n 
DR6: rj:rg = 3:6 Mpc/h 




fl / TT 
DR6: rj:r2 = 3:6 Mpc/h 




/" 



red shift - ■ 
projected 



I it 






0.5 

e / 7T 

DR6: r,:r2 = 6:12 Mpc/h 




e / -n 
DR6: rj:r2 = 9:18 Mpc/h 



redshift 
projected 




0.5 

e / 7T 
DR6: r,:r2 = 6:12 Mpc/h 




e / TT 
DR6: rj:r2 = 9:18 Mpc/h 



redshift - ■ 
projected --•■ 



I'H 



0.5 

9 / 7T 




/'V 



Figure 4. The reduced 3PCF on SDSS DR6 galaxies siiowing botli redsiiift space and projected measurements. Tiie filled symbols are 
redshift measurements, and the hollow symbols are projected. Triangles correspond to Mr < —21.5, circles with —21.5 < Mr < —20.5 and 
squares with —20.5 < Mr < —19.5. The three columns are different scales, specified by the first side of the triangle (ri) representing the 
smallest scale measured. Error bars denote la uncertainties calculated from 30 jackknife samples. 



given value of Vp is sensitive to scales greater than the 
same value of s, it will encode properties of the larger 
scales. As the uncertainty grows with scale, Qproj will 
reflect higher uncertainties than Q^ when rp ^ s. 

4.3.1. Luminosity Dependence 

To highlight the effect of luminosity on the reduced 
3PCF, we present the measurements of all three samples 
in Figure [5j As we noted in the 2PCF, the luminosity 
of the sample affects the clustering strength. However, 



we see the opposite behavior in Q{0) as brighter galax- 
ies show lower values of Q than fainter galaxies. Also 
remember that in the 2PCF (Figure [2]) brighter galaxies 
showed a stronger segregation with galaxy luminosity. 
If brighter galaxies exhibit stronger clustering, they will 
have a correspondingly higher linear bias b. Since ^ oc 6^ , 
the amplitude will be more sensitive to changes in b as 
opposed to the reduced 3PCF, where Q oc 1/6. The dif- 
ference of Q{9) with sample luminosity appears constant, 
or slightly increases, with scale. 
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Physically, we expect bright galaxies to predominantly 
live in galaxy groups or clusters and centered at the 
"knots" in LSS. Fainter galaxies will be more populous in 
the field and overdense filaments, which might show more 
configuration dependence in the reduced 3PCF. The un- 
certainties on these measurements prevent a clear demon- 
stration of this effect - though we might speculate that a 
few measurements at ri = 9 h~^Mpc hint at fainter sam- 
ples showing a stronger configuration dependence (e.g. 
LSTAR vs BRIGHT in lower right panel of Figure El 

4.3.2. Color Dependence 

We investigate the color dependence of the reduced 
3PCF using two volume-limited samples, designed to 
have unit magnitude bins with limiting r-band magni- 
tudes below and above L* . We divided the galaxies into 
"red" and "blue" sub-samples to probe the color depen- 
dence as described in SJH 

We examine the reduced 3PCF of the FAINT sample 
(-20.5 < Mr < —19.5) in Figure[6] We see configuration 
dependence at all scales, and the V-shape becomes more 
prevalent on larger scales; behavior that is reflected in 
both color sub-samples. The red and blue galaxies show 
a larger disparity at smaller scales, with a significant in- 
crease in the clustering difference at ri = 3 /i~^Mpc for 
both the redshift space and projected Q{0). Unlike the 
luminosity dependence, we notice a significant change in 



the configuration dependence between the color samples. 
This effect is most noticeable in the ri = 3 h~^Mpc con- 
figuration for the projected Q{9), which shows the red 
galaxies with a stronger configuration dependence than 
either the blue or full samples. At configurations where 
ri > 6 h~^Mpc, the blue sub-sample shows greater con- 
figuration dependence than the red, which can be best 
seen in redshift space. Overall, red galaxies typically 
show larger values of Q{0) than the blue and full sam- 
ples. 

We show the reduced 3PCF for our LSTAR sample 
(-21.5 < Mr < -20.5) in Figure [H We stih see con- 
figuration dependence at all scales, including color sub- 
samples — but the color dependence appears diminished 
with respect to the FAINT sample in Figure [6l Again, 
the red galaxies at ri = 3 h~^Mpc show a stronger con- 
figuration dependence than the blue according to Qproj- 
This sample has a volume almost four times larger than 
the FAINT sample, and the large scale ri = 9 /i~^Mpc 
measurements show significantly smaller uncertainties. 
As we speculated before, the blue galaxies appear to have 
a slightly stronger configuration dependence. 

Although the difference between red and blue sub- 
samples is both small and within the reported uncer- 
tainties, it might be significant as the errors are domi- 
nated by cosmic variance. If the blue sub-sample does 
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Figure 6. We show the configuration dependence of the reduced 3PCF for our FAINT galaxy sample with —20.5 < A/,- < —19.5. The 
sample is divided into "red" (red, dashed line) and "blue" (blue, dotted line) sub-samples based on SDSS g — r color (see sample description 
for details). The top row consists of redshift space measurements, and the bottom row depicts projected measurements. The three columns 
correspond to different scales specified by the first side of the triangle (ri) representing the smallest scale measured. The black line denotes 
measurements on dark matter in the Hubble Volume simulation, but matching the selection of the galaxies and includes redshift distortions. 
Error bars denote la uncertainties calculated from 30 jackknifc samples. 



have a stronger configuration dependence than the red 
sub-sample, it would support the typically understood 
notion that blue galaxies more commonly populate the 
filamentary structures at these scales. 

On smaller scales (ri = 3 h~^Mpc), the blue sub- 
sample in projected space shows less configuration de- 
pendence than the red in both samples. We attribute this 
to a real clustering diff'erence that is obscured by redshift 
distortions in the redshift space measurements. However, 
both Qz and Qproj show differences in the amplitude be- 
tween "red" and "blue" populations at the smaller trian- 
gle scales, but this difference declines at the larger scales. 

4.3.3. Comparing Galaxy and Mass Clustering 

In Figures [S] and [3 we also plot measurements of the 
reduced 3PCF based on the dark matter particles in the 
Hubble Volume (HV) simulation. Using iV-body sim- 
ulations enables reliable predictions well into the non- 
linear regime where accurate analytic models do not ex- 
ist (although som e recent work has been introduced, see 
ISmith et al.l 120081) . In addition, we can include obser- 
vational systematics by trimming HV particles to match 
the exact selection and volume of the observed galaxy 
sample, and include the effects of redshift distortions. 
The HV measurement serves as a comparison between 



clustering of the observed galaxies and that expected 
from gravitational evolution of a ACDM mass field. Ex- 
amining the figures, we make two observations: (1) the 
general shape is similar to that of the galaxy reduced 
3PCF, and (2) there is a significant offset in amplitude 
between galaxies and mass. 

We show our measurement of the reduced 3PCF Q on 
our BRIGHT galaxy sample {Mr < -21.5) in Figure H 
As shown previously, the black line depicts measurements 
from the HV simulation in comparison to the symbols 
that denote the galaxy measurements. The BRIGHT 
galaxy sample covers the largest volume, but is a factor 
of 10 less dense than the LSTAR sample and almost 30 
times less dense than FAINT. 

It remains clear that galaxies do not cluster exactly 
the same as the simulated mass field. However, there are 
significant commonalities. The overall shape is the same, 
and the predominant effect is an offset. Remember, the 
nature of the reduced 3PCF makes it insensitive to cos- 
mology (excepting the slope of the power spectrum) , so 
the differences we observe in Q are unlikely to be due to 
variation in assumed cosmology by the HV simulation. 
There is a large volume of work showing that galaxies are 
known to be biased tracers of the mas s field, which offers 
a natural explanation of the offset (jCoorav and Shetbl 



12 



DR6: FjiFg = 3:6 Mpc/h 



DR6: FjXg = 6:12 Mpc/h 



DR6; TjiFg = 9:18 Mpc/h 




Mass 

All Galaxies 
Red Subset 
Blue Subset 







0.5 

9 / 77 




DR6: TjXj = 3:6 


Mpc/h 


-21.5 < M, < 


-20.5 


i\ \ y 

1 <^ N__x^ 

■ ■ ■ \ T T r 


1 


1 .... 1 . 


1 



DR6: Pjirj = 6:12 Mpc/h 



DE6: rj:rj = 9:18 Mpc/h 



Mass 

All Galaxies 

Red Subset — — — 

Blue Subset 




0.5 

fl / TT 



0.5 
B / -n 




Figure 7. We show the configuration dependence of the reduced 3PCF, as in Figure|6l but for the brighter LSTAR galaxy sample defined 
by —21.5 < Mr < —20.5. The sample is divided into "red" (red, dashed line) and "blue" (blue, dotted line) sub-samples based on SDSS 
g — r color (see sample description for details). The top row consists of redshift space measurements, and the bottom row depicts projected 
measurements. The three columns correspond to different scales specified by the first side of the triangle (ri) representing the smallest 
scale measured. The black line denotes measurements on dark matter particles from the Hubble Volume simulation, but matching the 
selection of the galaxies and includes redshift distortions. Error bars denote la uncertainties calculated from 30 jackknife samples. 



120021 ). The l uminos ity dependence of the reduced 3PCF 
discussed in ii4.3.1l is further evidence of the same. The 
difference between clustering of galaxies and the underly- 
ing mass distribution is commonly referred to as galaxy- 
mass bias and depends on galaxy properties such as lu- 
minosity. We constrain the non-linear galaxy-mass bias 
parameters relating to these measurements in a compan- 
ion paper (McBride et al. 2010). 

4.4. Covariance of Galaxy Samples 

Correctly accounting for the covariance in data mea- 
surements is essential to accurately constrain theoretical 
models. If significant covariance exists, neglecting it can 
result in a statistical bias (inaccurate constraints) or an 
overestimate of significance. However, proper estimation 
and use of the covariance matrix remains tricky. For 
statistical measures of clustering in LSS, the covariance 
matrix encodes higher-order information. The covari- 
ance matrix of the 2PCF includes significant third and 
fourth order terms, whereas the covariance of the 3PCF 
has leading order contributions from up to sixth order. 
The covariance matrix itself is a complementary measure 
of clustering. 

We typically use the inverse of the covariance matrix to 
constrain models, such as a common x^ determination. 



Doing this makes an analysis extremely sensitive to the 
noise properties of the covariance matrix (the poorest 
determined eigenmodes can have a dramatic impact on 
the x^ value). While this fact is often overlooked in LSS 
analyses, ways of accounting for poorly resolved modes 
are w ell established for the 2PCF (see e.g.|N orber g et al. 
2008 ) and 3PCF (see e.g. iGaztahaga and Scoccimarrc 



20051 ). If a measured statistic is poorly resolved, i.e. the 



dominant contribution to the error is shot-noise, the off- 
diagonal elements of the covariance matrix can become 
excessively noisy. In such a case, using the full covariance 
matrix might be a poorer choice than a diagonal ap prox- 
imation (demonstrated in lChen and Szapudil 120061 ). 

For our galaxy samples, we resolve the covariance ma- 
trix using jackknife re-sampling methods defined by 30 
equal-area regions on the sky (a factor of two over the 
number of measured bins). 

We show the normalized covariance matrix of the 
2PCF for our three galaxy samples in Figure [SI At these 
scales we expect significant correlation in the covariance 
matrix for all the 2PCF measurements. In the power 
spectrum we expect the Fourier mo des to be statistically 
independent (jBardeen et al.l[l986[ ) and the off-diagonal 
elements of the covariance matrix in the 2PCF preserve 
this Fourier space property. We note the BRIGHT sam- 
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Figure 8. We show the configuration dependence of the reduced 3PCF of SDSS DR6 galaxies with Mr < —21.5. The top row consists 
of redshift space measurements, and the bottom row depicts projected measurements. The three columns correspond to different scales 
specified by the first side of the triangle (ri) which represents the smallest scale measured. The black line denotes measurements on dark 
matter in the Hubble Volume simulation, matching the selection of the galaxies and includes redshift distortions. Error bars denote la- 
uncertainties calculated from 30 jackknife samples. 



pie (right column) appears almost diagonal. Since we 
still expect correlated bins for the sample, we interpret 
this result as an undersampled covariance matrix. This 
can occur when the most significant contribution is shot- 
noise (since Poisson errors are not correlated). While 
this is still a "true" measure of the errors, care must 
be taken when using the full covariance matrix as the 
off-diagonal elements can be noisy. For the two fainter 
samples, we see the projected Wp(rp) measurements ap- 
pear much more correlated than the redshift space ^(s). 
Since Wp projects along the line-of-sight, the projection 
mixes scales between (s = r^) and (s = (tt^ -f r^)^/^), 
inducing a correlation in the measurements. Seeing this 
effect supports the validity our of covariance estimation. 
Finally, we note that the correlation appears to increase 
for the fainter galaxy sample. In the 2PCF, this increase 
of correlatio n with fainter galax y samples has been pre- 
viously seen (jZehavi et al.ll2005l) . 

The covariance matrices for the reduced 3PCF on 
large scales (ri — 9 /i~^Mpc) for the three galaxy sam- 
ples are shown in Figure [TUl Note the significant off- 
diagonal structure for all samples, which is compara- 
ble to that seen in theoretical st udies (see figure 9 in 
iGaztanaga and Scoccimarrol 120051 ). The correlation be- 
tween a few neighboring bins at « tt is enhanced due to 



physical overlap of our binning scheme (the same triplet 
can be counted more than once). We again see that the 
correlation in the projected measurements indicate more 
correlation in comparison to redshift space. Analogous 
to the 2PCF, we find that fainter galaxy samples ap- 
pear to have increased correlation between bins of the 
reduced 3PCF. Although we show only the largest mea- 
surements, we find significant off-diagonal elements at 
all scales. Clearly the use of a diagonal approximation 
to the covariance matrix would be a poor assumption. 

4.5. Effects of Super Structures 

Large coherent structures or ^^ super structures" , such 
as the Sloan Great Wall (SGW; Gott et al. 2005), can 
dramatically affect clustering measurements. Detailed 
analyses on SDSS g alaxy sample s have documented this 
in both the 2PCF (jZehavi et alJl 2002, 2005, 2010) and 
redshift space reduced 3PCF llNich ol et al.i 12006;) . One 
advantage of using jackknife re-sampling methods for er- 
ror analysis is that we probe the variation between differ- 
ent spatial regions essentially "for free" . We investigate 
this variation using our 30 jackknife samples. There are 
several reasons this variation of the 3PCF is worth inves- 
tigation beyond that in the literature. First, our samples 
are based on a newer SDSS sample (DR6) which includes 
additional regions of the sky resulting in a larger volume 
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Figure 9. The normalized covariance matrix for the 2PCF, both in redshift (top row) and projected space (bottom row) for our three 
DR6 galaxy samples of different magnitudes. From left to right, the panels indicate FAINT to BRIGHT galaxy samples. The matrix 
is normalized such that diagonal elements are set to unity, rather than the \a error. The solid, dashed and dotted contours correspond 
respectively to values of 0.70, 0.85 and 0.99. 



than previous studies. Second, as we have repeatedly 
mentioned, measurements of the reduced 3PCF are af- 
fected by a chosen binning scheme making it important 
to document the effects of structures given our exact pa- 
rameterization. Finally, we want to understand the im- 
pact on detailed measurements of the projected reduced 
3PCF for which the effect of super structures has not 
been previously investigated. 

We identify 6 out of 30 regions of the sky which show 
large deviations in the reduced 3PCF. These "jackknife 
regions" characterize a jackknife sample by being omit- 
ted from a clustering measurement. The northern SDSS 
DR6 footprint in shown in Figure fTTl with the entire sam- 
ple displayed in gray and the six regions highlighted by 
color. Two regions encapsulate the majority of the SGW, 
specifically the red and magenta regions at a J2000 dec- 
lination of zero. Overall, the jackknife regions appear 
contiguous and rectangular (excepting the apparent ge- 
ometry of the Aitoff sky projection). Please note the 
black region, however, which is split between two sides 
of the survey. The algorithm we use to define the jack- 
knife regions must occasionally make such divisions. 

Before examining the reduced 3PCF for these regions, 
let us briefiy review the jackknife re-sampling method. 
We excise a jackknife region from the full sample and 
measure the clustering. This means that a measurement 
on a specific jackknife sample represents the clustering 
of the entire sample omitting the jackknife region. If 



the clustering on the jackknife sample deviates strongly 
from the average of all samples, it means that a spe- 
cific jackknife region dominates the measurement for the 
entire sample. Without that region, the overall cluster- 
ing would be significantly different. This is a profound 
concept, as we do not expect such clustering differences 
given the volume of these SDSS samples. 

We investigate the effects on the reduced 3PCF in two 
of our three galaxy samples. Although the jackknife re- 
gions based on sky location are consistent across both, 
the redshift limits vary. This results in a volume that is 
not identical between samples, although it does overlap. 
We use our LSTAR (-21.5 < Mr < -20.5) and FAINT 
(—20.5 < Mr < —19.5) samples which both include the 
SGW at a mean redshift at z w 0.08. 

To highlight the clustering deviations between samples, 
we plot the residual of the reduced 3PCF, as defined by 
(fT3l) . We subtract the mean reduced 3PCF of the 30 
samples from each individual measurement, and normal- 
ize this difference by the jackknife variance. As noted in 
(fT4|) . the pref actor for the jackknife variance is (■^^^), 
and not the familiar jf, so error estimates do not de- 
crease with N. In terms of the residuals, this means no 
jackknife sample will deviate significantly beyond the er- 
ror. This scaling of the residuals emphasizes how each 
jackknife region significantly contributes to the la error 
estimate. 



15 



Covariance: redshift -30.5 < M, < -19.5 




Covariance: redshift -21.5 < M„ < -20.5 



Covariance: redshift M_ < —21.5 



/ 














^ 






'-'■\ 


/ 






1 


r ~ ~ ^ 


^ 


1 ^ ' 





0.4 0.6 O.B 

fl / IT 



0.4 0.6 

e / rr 




Covariance: projected —20.5 < M 



-19.5 



Covariance: projected -21.5 < M 



20.5 



Covariance: projected M < —21.5 




Figure 10. The normalized covariance matrix for tlie reduced 3PCF, both in redshift (top row) and projected space (bottom row) for 
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Figure [T^ shows the residual of the reduced 3PCF for 
the LSTAR galaxy sample (-21.5 < Mr < -20.5). The 
left three panels are results for redshift space, Qz, and the 
right correspond to projected space, Qproj- The scale in- 
creases downward in order with ri = 3, 6 and 9 /i~^Mpc. 
The gray lines represent results from the 30 — 6 = 24 "or- 
dinary" jackknife samples. The colored lines correspond 
to jackknife samples that omit the jackknife regions high- 
lighted in Figure [TTJ Most of the regions are close to the 
mean value (i.e. around zero), but several samples de- 
viate, which always happens in the negative direction. 
If a structure in the jackknife region, which is included 
in all other samples, boosts Q{0), then a negative resid- 
ual results. In the LSTAR sample, Qz appears to be 
boosted at all scales by galaxies in the blue jackknife re- 
gion. As scale increases, structure in the red region also 
appears to increase Qz- Both regions also have an effect 
on Qproj, with red clearly dominating at ri = 9 h~^Mpc 
and blue at the small scale ri = 3 /i~^Mpc measurement. 
In Qproj, we note the black region, which is a physical 
neighbor to the blue region, strongly affects the small 
scale reduced 3PCF. 

It appears the red region, which encloses a significant 
portion of the SGW, predominantly affects the reduced 
3PCF on large sca les; this has b een s een before in the 
reduced 3PCF by iNichol et al.l (|2006l ). The blue and 
neighboring black regions were first included in the DR5 
release. At smaller scales, these regions clearly dominate 



the reduced 3PCF measurement. We visually inspect the 
galaxy distribution within the several of the anomalous 
regions, and find that these jackknife regions (consisting 
of 1/30 of the entire SDSS area) enclose a single domi- 
nant super structure, or sometimes just part of one struc- 
ture straddled across several regions. For example, there 
is an extremely clustered region at the boundary of the 
blue and black regions with a median redshift around 
z « 0.11. This is clearly distinct from the SGW. We 
refer to such regions as "super" structures since, like the 
SGW, they are coherent overdensities that are not grav- 
itationally self-bound in contrast to a galaxy cluster. In- 
dividually, these regions shift the reduced 3PCF by less 
than 10% - but this analysis neglects cumulative effects 
(remember, the same super structure is split between the 
blue and black regions). 

We examine the FAINT galaxy sample (-20.5 < Mr < 
— 19.5) in Figure fT3l Recall from the reduced 3PCF mea- 
surement in Figure |6l that the uncertainties were quite 
large, much more than the LSTAR sample. The residuals 
at the large scales are completely swamped from the clus- 
tering in the red region. Excluding this region changes 
the reduced 3PCF by approximately 20%, which we at- 
tribute to galaxies clustered in the SGW. The black and 
blue regions seem inconsequential for this sample; this 
makes sense as this fainter sample has a maximum red- 
shift of z = 0.086, thereby trimming the structure at 
z « 0.11 which accounted for the significant deviation 
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Figure 11. We show six selected jaclcknifc regions on tiie sky (yel- 
low, blue, black, green, magenta, and red) in comparison with the 
full galaxy sample (grey points) for the SDSS DR6 Nor th footprint 
in J2000 equatorial coordinates. The Sloan Great Wall i fGott et al.l 
1200 5) is located at ~ in declination, and is included in two of the 
six selected jackknife regions (red and magenta). 



in the LSTAR sample. We also see that a new region, 
denoted in yellow, dominates at small scales. 

5. DISCUSSION 

Our clustering measurements on our galaxy samples 
produced the following main results: 

• Q{0) exhibits configuration dependence at all scales 
(3 — 27 h^^Mpc), which is not consistent with Q 
being simply a constant. 

• Larger scales (9 — 27 h~^Mpc) exhibit a stronger 
configuration dependence and display the V-shape 
expected from gravitational evolution. 

• All galaxy samples show significantly different clus- 
tering strength than the mass estimates. 

• Uncertainties for the reduced 3PCF increase for 
larger scales and with smaller sample volume. 

Generally, the shape of our SDSS galaxy measurements 
appear similar to predictions of gravitational collapse in 
the ACDM model realized by the Hubble Volume simula- 
tion. The difference in clustering strength between galax- 
ies and mass predictions can be accounted for as galaxies 
of these luminosities are expected to be biased tracers of 
the mass field. We constrain the non-linear galaxy-mass 
bias using a detailed error analysis in a companion paper 
(McBrideet al. 2010). 

We studied measurements of three galaxy samples 
characterized by different luminosities, attributing ob- 
served variations as a luminosity dependence in the re- 
duced 3PCF. In gXHwe found: 
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Figure 12. The residuals, as defined in JTSj, for 30 jackknife 
samples of the reduced 3PCF in redshift and projected space for 
the DR6 galaxies with —21.5 < Mr < —20.5. A jackknife sample is 
defined by taking the full sample and excluding a specific jackknife 
region. Colors of the six selected jackknife samples corresp ond to 
excluding galaxies in the region of matching color in Figure [TT] 



• Luminosity dependence affects all scales at about 
the same level, which primarily changes the ampli- 
tude of Q. 

• The discrepancy between galaxies and mass predic- 
tions appears larger for brighter galaxies. 

• The covariance matrix appears more correlated for 
fainter galaxy samples. 

The division of two volume-limited samples into sub- 
populations based on color led to these observations: 

• The g — r color split isolates populations with dif- 
ferent clustering properties, 

• Red sub-samples have a higher average Q value 
than blue. 

• The difference between red and blue sub-samples 
is greatest at small scales and includes changes to 
the configuration dependence. 
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Figure 13. The analogous figure to ll2l the residuals as per Ijl3p . 
for 30 jackknife samples of the reduced 3PCF in redshift and pro- 
jected space for the DR,6 galaxies with —20.5 < Mr < —19.5. A 
jackknife sample is defined by taking the full sample and exclud- 
ing a specific jackknife region. Colors of the six selected jackknife 
samples corres pond to excluding galaxies in the region of matching 
color in Figure [TT] 



• The effects of galaxy color on Q are more pro- 
nounced than luminosity at small scales, where 
galaxy color more dramatically affects the configu- 
ration dependence. 

A concern might arise that the defined samples are 
not complete for both color definitions. This could re- 
sult from possible e volutiona ry effects (discussed in the 
context of DEEP2 in lGerkeet al.ll2007[ ) or inaccuracies in 
the X-correction due to differences in galaxy morphology. 
We confirmed that there were minimal to no differences 
in the measured reduced 3PCF by creating samples de- 
fined to be more conservative in their sample description, 
where we found the largest variation to be less than 10% 
of the reported errors. 

An analysis of the 2PCF for SDSS galaxies 
(jZehavi et al.ll2005l I2010D conceptually explains the clus- 
tering difference of the brighter galaxy samples. Brighter 
galaxies remain "more biased" than fainter samples re- 
sulting in a larger discrepancy between galaxies and mass 



for brighter galaxy samples. We see a similar result in our 
clustering measurements and find that Q shows less dis- 
crepancy between samples since Q oc 1/6 whereas ^ oc 6^ 
("more biased" refers to a higher linear bias value of 
b). The stronger effect of color separation on smaller 
scales (3 — 9 fe~^Mp c) agrees with the conclusions of 
iBlanton et al.l ()2005b[ ) that galaxy color is strongly tied 
to local envi r onmen t. Recent analysis of the 2PCF by 
iZehavi et al.l (|2010( ) showed striking differences in clus- 
tering between luminosity and color, where luminosity 
predominately shifts the amplitude, whereas different 
color samples show a different slope of the 2PCF. We find 
that the configuration dependence of Qproj is stronger 
for red sub-samples at smaller scales (ri — 3 h^^Mpc). 
This scale is still beyond a reasonable size for individual 
dark matter halos. However, it might suggest that re- 
gions outside large halos have a surrounding anisotropic 
distribution of red galaxies, perhaps tracing the infall re- 
gions. This interpretation supports obser vations of the 
angular distribution of satellite galaxies bv lAzzaro et al.l 
(|2007). where they find an alignment of satellites along 
the major axis of their host. They find the strongest 
evidence of alignment when considering red satellites 
of red hosts, which would be best represented by our 
red sub-sample. On measurements of large scale tri- 
angles (ri — 9 h^^Mpc), we conclude that both red 
and blue populations exhibit similar configuration de- 
pendence given the large errors. However, we speculate 
that the blue galaxies occupy regions that show stronger 
shape dependence. 

A detailed study of th e reduced 3PCF in redshift space 
(jGaztanaga et al.ll2005n was conducted on a different red- 
shift survey: the two-degree fiel d galaxy redshift sur- 
vey f2dFGRS: IColless et al.ll200lD . Their parameteriza- 
tion of triplets is very close to ours, and the analysis is 
in qualitative agreement with our measurements of Qz- 
They also divided a volume-limited sample into "red" 
and "blue" sub-samples, noting that red galaxies typi- 
cally have a larger value of Q^ below 12 h~^Mpc. 

The reduced 3PC F of SDSS ga.laxies was also studied 
in redshift space bv lKavo et al.l ()2004[ ). Their measure- 
ments that are sensitive to shape (see their Figure 13) 
show a configuration dependence that grows with scale. 
Their Q^ for s = 2.5 h~^Mpc shows very little configu- 
ration dependence with Qz < 1 for all 9, which is com- 
parable to our measurements at s = 3.0 /i~^Mpc, and 
confirm the effects of redshift distortions on these scales. 
In contrast to their findings, our measurements suggest 
a luminosity dependence of the reduced 3PCF which is 
apparent in the equilateral Qeq (Figure [3]) but more so in 
the configuration dependent Q{0) (Figure [5]). However, 
even our study suggest a weak statistical significance, es- 
pecially on the larger scales. T he difference between our 
results and iKavo et al] (|2004[ ) is likely due to statisti- 
cal strength of the galaxy samples - since DR6 contains 
many more galaxies in a much larger volume than their 
earlier data, it makes the weak luminosity dependence 
noticeable. We note a discrepan cy between t he res ults 
with respect to color dependence. IKavo et all ()2004[ ) see 
very little clustering difference between populations on 
our small scales (middle panels of their Figure 13). In 
addition, their measurements show the blue population 
has larger value of Q than the red on larger scales (bot- 
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torn panels of their same figure) , whereas our results show 
little difference (our right hand panels of figures [7] and |6]). 
However, the latter effect is within la given the uncer- 
tainties of both studies, making any conclusions sugges- 
tive at best. The differences might be accounted for by 
one of the many subtle differences of the analyses, which 
include (1) different SDSS dataset, (2) different galaxy 
sample and color definitions, and (3) different choices of 
triangle parameterization and binning. Sorting out these 
systematic differences would require a detailed joint com- 
parison which is not warranted by the low statistical sig- 
nificance of the discrepancy. 

We compare clustering of the reduced 3PCF in redshift 
and projected space to find: 



• \a^proj 



successfully recovers configuration depen- 
dence at small scales (3 — 9 ft,~^Mpc), which is lost 
in Qz- 






has a higher amplitude at small scales than 



• Qz and Qproj both converge at large scales (9 — 
27/i-iMpc). 



Both Q^ and Qproj were measured by iJing and Borneii 
(|2004l ). Although they analyze entirely different galaxy 
data (2dFGRS), we expect similar behavior between red- 
shift and projected space measurements. Their results 
do not agree completely with our findings on compara- 
ble scales and triplet configuration. While they notice 
an increase in the amplitude of Q between redshift and 
projected measurements in their "full" sample (compare 
their figures 7 and 12 for r — 3.25 /i~^Mpc with u — 2), 
it is not as much of a difference as our results suggest. 
They also see no configuration dependence in Qproj ■ We 
suspect their measurements obscure these features due 
to their choice o f bi nning an d triangle param e teriza tion, 
as described in [Gaztahaga and Scoccimarrol (I2005D. A 
separa te analysis of 2dFGRS data by 'Gaztanag a et ahl 
()2005f ). which uses a parameterization similar to what 
we employ, presents Qz meas urements that show mo re 
configuration dependence than iJing and Borneii ()2004[ ) . 
We have seen that the covariance of the reduced 3PCF 
yields significant structure at all resolved scales and in all 
of our galaxy samples. While our choice of wide bins con- 
tribute to the correlation, especially when 9 approaches 
zero or tt, we conclude most of the correlation is physi- 
cal. Measuring the configuration dependence of the re- 
duced 3PCF requires closely packed measurements. The 
range of scales probed by a specific Q(ri, r2, 9) measure- 
ment is the change in scale of the third side, equal to 2ri 
for our measurements. Empirically estimating the co- 
variance enables both systematic correlations (i.e. over- 
lapping bins) and physical covariance between reduced 
3PCF values to be taken into account for quantitative 
constraints. There is a limit to the effectiveness of empir- 
ical determinations: if we bin the correlation function too 
finely, such that pair and triplet counts become poorly 
sampled, the uncertainties will be dominated by Poisson 
noise and the covariance matrix will look diagonal. This 
would be a false representation of the correlation, and 
care must be taken to properly account for noise in the 
covariance matrix if used in an analysis. We note that we 
approached this undersampled limit for the covariance of 



our brightest sample {Mr < —21.5) even with our wide 
binning scheme (bin- width was 0.25r, where r is the scale 
probed). 

If we restrict ourselves to the quasi-linear regime where 
^ < 1, represented in the reduced 3PCF by the largest 
scale measured (ri = 9 h~^Mpc), we see significant 
structure to the covariance matrix. This type of struc- 
ture, alm ost an "X" pattern, has been seen in simula- 
tions (^e.g. lGaztanaga and Scoccimarroll2d05l ). We notice 
that the overall correlation increases as the galaxy sam- 
ple becomes fainter for this scale. This luminosity de- 
pende nce of covaria.nce h as been seen in the projected 
2PCF (jZehaviet al.ll2005[ ). This work is the first time it 
has been resolved in observational measurements of the 
reduced 3PCF, which we note in both redshift and pro- 
jected space. An important implication of this result is 
that care must be taken to properly estimate the covari- 
ance matrix specific to the galaxy sample being studied. 
The correlation matrix of one galaxy sample is not nec- 
essarily representative of another. 

We investigate the variation of clustering in jackknife 
samples to enable a view of clustering in SDSS galaxy 
samples with an alternative perspective. A few regions, 
which each likely contain one or perhaps two "super 
structures", dictate the clustering of the entire sample 
- even with the sizable volume represented by the SDSS 
samples. Different structures affect different scales. We 
clearly showed anomalous structures in a few rare re- 
gions dictate the accuracy of clustering measurements. 
The standard proposed solution is to continue collecting 
bigger samples in the hope of a large enough volume to 
average over multiple rare "super" structures. Clearly, 
these super structures can affect quantitative descrip- 
tions of bias. How best to handle this in detailed analyses 
remains an open question, but is most often addressed 
by comparing sub-samples with matching volumes (so a 
specific structure affects them equally). 

We agree with iNichol et al.l (|2006) that the reduced 
3PCF shows more sensitivity to the presence of super 
structures than lower ord er statistics such as the 2PCF 
(jZehavi et al.l 120051 [2010l ). Ahhough we do not include 
details in this paper, we also noticed marked differences 
in the 2PCF due to our jackknife regions. With the re- 
duced 3PCF, deviations caused by the super structures 
appear largely as an amplitude offset affecting triplets of 
all configurations equally. It would be interesting to see if 
the lack of configuration dependence in the deviations be- 
tween jackknife regions continues for smaller scales that 
approach the size of large halos. The configuration de- 
pendence at the scales we measure are predominately 
due to filamentary structures (3 — 27 /i~^Mpc correlates 
structure between 3 different halos), making it unlikely 
that one specific region would dramatically enhance or 
erase this signature. 

6. SUMMARY 

In this paper we present clustering measurements of 
three volume-limited samples of SDSS galaxies. We 
investigate the 2PCF in §4.11 and find clustering mea- 
surements consisten t with other analyses of SDSS data 
(jZehavi et al.l 120051 I2O1O0 . Specifically, we note the 
2PCF is reasonably well approximated by a power-law 
model and brighter galaxies result in stronger clustering 
at all measured scales (0.3 — 30 /i~^Mpc). 
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We consider the reduced 3PCF in t j4.2l and HA.'Sl and 
find significant configuration dependence on intermedi- 
ate to large scales (3 — 27 h~^Mpc), in general agree- 
ment with predictions from ACDM. These results are 
in contrast to the hierarchical ansatz where the reduced 
3PCF shows no dependence on triplet shape. Below 
6 /i^^Mpc, the redshift space reduced 3PCF shows re- 
duced power and weak configuration dependence in com- 
parison with projected measurements. These results in- 
dicate that redshift distortions, and not galaxy bias, can 
make the reduced 3PCF appear consistent with the hier- 
archical ansatz. We address the luminosity dependence 
of our samples in >j4.3.1l and color dependence in i i4.3.2l 
Compared to the lower order 2PCF, the reduced 3PCF 
exhibits a weaker dependence on luminosity with no sig- 
nificant dependence on scales above 9 h~^Mpc. On scales 
less than 9 h~^Mpc, the reduced 3PCF shows a more dra- 
matic dependence on galaxy color than on luminosity, 
which includes significant changes to the configuration 
dependence. 

We resolve the covariance matrices of our cluster- 
ing measurements in ij4.41 calculated by jackknife re- 
sampling using 30 samples. We find significant structure 
in the covariance with large off-diagonal elements depict- 
ing strong correlations. These results demonstrate that 
an assumption of a diagonal covariance matrix is a poor 
choice, and the correlations must be taken into account 
for any quantitative analysis. The covariance matrix can 
be improperly resolved, such as when measurement bins 
are too small, complicating the use of the full covariance 
matrix due to noisy modes. We show that the overall 
correlation generally increased with fainter galaxy sam- 
ples, suggesting a luminosity dependence to the structure 
of the covariance. Clearly care must be taken to prop- 
erly estimate the covariance matrix specific to the galaxy 
sample being used for quantitative constraints — a fact 
which is often overlooked in recent work on the reduced 
3PCF. 

In ij4.5[ we demonstrate how large coherent structures, 
referred to as ^^ super structures" , affect these clustering 
measurements. We use 30 independent regions on the 
sky, and show that 6 of the 30 produce anomalous devi- 
ations in clustering, with different structures dominant 
at different scales. These regions, each of which contain 
one or perhaps two super structures, dictate the cluster- 
ing of the entire sample - even with the sizable volume 
of the SDSS galaxy samples. Two of these regions are 
coincident with th e huge structure known as the Sloan 
Great WaU (SGW: [GotteTall 120051) . whic h has already 
been shown to strongly affect clustering (jZehavi et al.l 
[20051 [20l0l: iNichol et al.l [20061 ). We further show that 
a specific region dominates clustering measurements dif- 
ferently based on the galaxy sample and scale. No one 
structure dominates all scales, but almost all measure- 



ments are affected by at least one region. 
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Figure Al. We show the fractional difference when correcting for sky completeness in the 2PCF in redshift (left panel) and projected 
(right) space. The fractional difference is defined as (^ — i;w)/iw where ^^ denotes the weighted (corrected) quantity. We show two galaxy 
measurements using two weighting schemes, where we vary the weight in calculating ^m- The solid lines depict weighting by the real sky 
completeness, and the dotted lines represent the same weights, but randomly assigned to galaxies. As we expect, the randomly shuffled 
weights show little difference with the unweighted 2PCF. 



APPENDIX 

A. MEASUREMENT SYSTEMATICS 

A.l. Estimating the Correlation Functions 

A detailed comparison between 2PCF estimators bv lKerscher et all (|200ClD found that ^^s performs as well or better 
than other formulations. An investigation of three-point estimators shows C ss performs favor ably to alternatives, with 



stable estimates when using the least number of randoms (see appendix in lKavo et al.|[200^ . 

The validity of our n-point calculator is of paramount importance. We carefully scrutinized the results of our main 
analysis code by separately implementing naive calculators for spatial and projected counts for both the 2PCF and 
3PCF. In addition to these internal consistency checks, we verified the accuracy with external codes used by diff erent 
research groups. For the spatial 3PCF, we reproduced exact results to those calculated by no t (iGrav et al.ll2004f) . We 
also checked that our projected 2PCF produced identical measurements to the code used bv lZehavi et al.l ()2005[ ). 

A. 2. Effects of Sky Completeness 

Our measurements must take into account the angular sky completeness of the survey. We restrict our analysis 
to volume-limited galaxy samples, and as such, the radial selection function typically used in flux-limited samples 
plays no role (it is defined to be unity for at all redshifts within the galaxy samples). The sky completeness can 
vary due to factors such as missing plates (specific regions in the sky), poor quality spectra and fiber collisions (see 
^2.^ . The completeness is well characterized by sector s, and calculated by comparing the number of targeted galaxies 
with the corresponding number of spectra obtained (jBlanton et al.ll2005af ). We correct the estimated correlation 
functions by applying a multiplicative weight to pair counts such that each galaxy uses a weight assigned from the 
inverse completeness of the respective region. For the large scales, this weighting co rrects the clustering strength fo r 
regions where you know galaxies exist, and is standard practice for measurements (jZehavi et al.l 12002 7 120051 . I201(D. 
To test the magnitude of this c orrection, we consider a sample of 73 320 galaxies from an earlier SDSS release (DR5; 
lAdelman-McCarthv et al"]|2007l . specifically referred to as saf e26), where we only consider galaxies with completeness 
between 0.5 and 1.0, with average and median values above 0.96. 

We first show the effect of weighting by sky completeness on the 2PCF in Figure lAll We see this sample shows 
an approximately 6% offset in both redshift and projected space at all scales below 30 h~^Mpc if the weights are 
unaccounted for (see solid lines in both panels). To demonstrate the validity of this test, we have re-measured the 
exact same galaxy data but randomly shuffled the weights which shows no fractional differe nce (dotted lines). 

We investigate the effects of neglecting sky completeness on the reduced 3PCF in Figure IA21 where we show both 
the absolute difference (left column) and fractional effect (right). We notice a systematic offset that increases with 
scale in both redshift and projected measurements. On larger scales (ri = 9 /i~^Mpc), we find about a 12% deviation 
in redshift space and almost 20% difference for projected space, showing more sensitivity to completeness weighting 
than the 2PCF. We estimate the uncertainties in these measurements from 30 jackknife samples at approximately 12% 
and 17% respectively, and we conclude that sky completeness must be considered for accurate 3PCF measurements. 
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Figure A2. The effects of correcting for sky completeness in the reduced 3PCF. The Qcw denotes the completeness weighted (corrected) 
quantity. The effect for both redshift space (red, x-marks) and projected space (blue, open circles) measurements are described by the 
absolute difference (left) and the fractional effect (right). Not accounting for sky completeness introduces a systematic offset that varies 
by scale, which can be seen in the left panel. The projected measurements appear more sensitive to sky completeness than redshift space 



A. 3. Effects of Binning 

The choice of binning scheme affects measurements and covariance matrices of the configuration dependence in the 
3PCF. To be specific, we refer to the choice of bin-size as well as their spacing by "binning scheme". For the 2PCF, 
the most common scheme is to use log spaced bins in r to account for the dynamic range of scales and power law 
dependence of the 2PCF. In this scheme, bins at larger r correspond to larger bin-widths. When we measure (3(6*), 
the angular bins are closely packed over a much smaller scale range making a choice of log spaced bins impractical. 
If the bin-size is too large, we notice an attenuation or averaging out of the configuration dependence. On the other 
hand, if the bin-size is too small, we do not resolve the measurement nor the covariance. Both the sample size and 
number density will impact the efficiency of a binning scheme. Even if we find an acceptable choice for one sample, 
it might show different effects on another sample. This effect remains more dramatic for the 3PCF due to the larger 
parameter space of potential configurations. 

Naively, for a given data set of size N , we expect the number of available triplets (i.e. N'^) to be much larger than 
pairs (iV^). However, the configuration dependence of the 3PCF is a function of three dependent variables. Any 
specific configuration represents a tiny slice through the available data, making the 3PCF much harder to determine. 
For a rough idea, we can compare the number of triplets to the number of pairs for a specific 3PCF configuration. 
For a choice of binning scheme with no overlap for the ri = 9 /i~^Mpc triangles, we find that approximately 1 in 
every 1000 pairs contribute to the 2PCF for the denominator of Qproji^)- For the 3PCF (the numerator), only 1 in 
67, 000, 000 contribute. Even if we consider that the 3PCF scales as the square of the 2PCF (i.e. ( oc ^^), we notice 
the triplet count in the 3PCF remains smaller by a factor of ~ 67. 

The impact of bin-size has only recently been addressed in the literature fsee'Gazt afiaga and Scoccimarrol l2005') . We 
believe this remained unresolved for two main reasons (1) the availability of large datasets to statistically determine 
finely binned higher order moments, and (2) the computational complexity of calculating them. For a small data sample, 
it may not be possible to measure the 3PCF with small enough bins. This is not the case with our SDSS samples. Most 
methods of estimation solve the computational complexity by performing counts after p re-gridding the data, thereby 
imposing a bin-size effect. While this can help mask the effect, it does not hide it entirely. iGaztafiaga and Scoccimarrol 
(|2005.) comment on the effect of bin-size using a pre-gridding technique by using a sufficiently fine grid. Our estimation 
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Figure A3. We show the reduced 3PCF on a subset of SDSS data. At small scales, we expect the redshift space distortions to cause 
a "U-shape" signal due to elongated fingers-of-god. This plot illustrates that small binning (±0.1 /i~^Mpc) in the two sides (ri and r2) 
resolves the "U-shapo" even with large 6 bins. For comparison, we show "Large Bins" defined to correspond to the binning scheme of 
INichol et al.l (2006 ) (i.e. ±0.5 h~^yipc). The error bars show the combined Poisson errors of the respective counts and signify the statistical 
significance of the bin-sizes. 



of correlation functions uses an efficient counting algorithm to yield exact counts, so we do not need to pre-grid our 
data. At large scales, measurements using a sufficiently fine pre-gridding technique and those with an exact count 
converge. At small scales, pre-gridding becomes less effective at reducing the computational expense and may become 
p rohibitively expensive. 

iGaztanaga and Scoccimarrol ()2005[ ) suggest a good test of bin-size is to measure the small scale configuration depen- 
dence of the reduced 3PCF in redshift space. Redshift distortions produce elongated finger-of-god structures, which 
should significantly amplify the signal of collapsed triangles at 6* = and 6 = tt. This characteristic "U-shape" will not 
be present if the bin-size is too large, since the finger-of-god structures are effectively averaged over. For the 3PCF, 
we must consider the bin-size in three parameters, i.e. each side of the triplet. In figure IX3l we show three choices of 
bin-size in 6 when two sides of the triangle (ri, r2) are tightly constrained (±0.1 /i~^Mpc). Even with large bins in 6, 
all measurements equivalently show configuration dependence, demonstrating the i mportance of bin-siz e for ri and r2 ■ 
We also show a measurement with "large bins" , constructed to match those used in INichol et al.l ()2006D where the first 
side of the triangle (ri) varies by ±0.5 /i~^Mpc. The larger bin-size completely masks the configuration dependence 
expected fr om t he redshift distortions. 

In Figure IA4I we extend the comparison of bin- width to the three specific scales that we measure in redshift and 
projected space. We utilize our fiducial binning scheme, which consists of 15 linear spaced bins in 9 with the bin-width 
chosen to be a fraction (denoted with /) of the scale of the bin midpoint. To be clear, the bin-size for ri and r2 also 
change appropriately with /. We show results for / — 0.1,0.2 and 0.3 with la Poisson uncertainties calculated from 
bin counts. The larger bin-width smooths the configuration dependence at all scales, with a dramatic effect on the 
ri — 9 h^^Mpc triangles. This occurs in both redshift and projected space measurements of Q{0). Physically, larger 
bins allow a greater range of configurations to be represented in each bin. 

We keep the number of bins fixed (at 15) but vary the bin- width; this results in an increased overlap of configurations, 
and hence imposes a larger correlation in the covariance between neighboring bins. Basically, the physical overlap is 
simply larger for equally spa ced 9 bins when 9 corresponds to '^ or '^ tt. We show the covariance matrices for 
Qz{9) and Qproj{6) in Figure lASl for the largest scale (ri — 9 /i^^Mpc) and two bin-widths using / = 0.1 and 0.25. 
The significant configuration overlap in / — 0.25 results in a larger correlation, as we expect. However, we also see 
increased correlation in non-overlapping bins (see the ~ with 9 ^ t: bins; the top left and bottom right corners). 

Clearly identified differences due to bin-size are prevalent in both measurements of the reduced 3PCF and its 
covariance. The general rule of thumb is that smaller bins, assuming they are well resolved, produce more accurate 
results - but that assumes we properly resolve the covariance. 
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Figure A4. In our measurements on DR6 data, we adopt a fiducial binning sclieme using linear spaced bins in d with a bin-size a set 
fraction of the midpoint. For the three triangle scales of interest, we compare this scheme using three fractional bin-widths on Q-^ (top 
row) and Qproj (bottom row): 10, 20 and 30%. The error bars represent Poisson errors on the counts from each bin. Larger bin-width 
measurements show smaller uncertainties and less configuration dependence. We use the LSTAR galaxy sample from DR6 selected to have 
-21.5 < Mr < -20.5. 



B. PROJECTED CORRELATION FUNCTIONS 

The purpose of employing projected correlation functions is to minimize the effects of redshift distortions on clustering 
measurements. However, we might ask how effective the projected statistic really is. We use the Hubble volume (HV) 
iV-body simulations to investigate this question. Since we want to apply our results to observational galaxy surveys, 
we construct realizations of dark matter (DM) that have the same footprint and volume of an observation galaxy 
sample. This should incorporate any edge effects or issues of finite volume that are also present in galaxy data. We 
use line-of-sight peculiar velocities to create a redshift space distribution of DM. In a few cases, we compare this to 
a similar distribution without any redshift distortions, which we call real space. Two important decisions need to be 
made when using projected measurements: (1) how large of Att bins do we use to integrate over and (2) what is the 
maximum line-of-sight distance for the integration (TTmax)- 

The first question stated above is relatively easy to ad dres s. We want to choose bins of Att that remain small enough 
to prevent a smoothing bias. In the left panel of Figure [BTI we show the anisotropy introduced by redshift distortions 
on the 2PCF, the Vp — tt diagram, within our HV test sample. Using these data, we calculate the projected 2PCF 
by integrating to 7r,nax — 40 /i~^Mpc with different values of Att. We compare each with the fiducial measurement 
using Att = 2 /i^^Mpc, and show the fractional difference in Figure [BTI We notice a very small effect, and the largest 
deviation appears at small projected separation where £,{rp,Tr) changes rapidly. The systematic effect remains below 
the 2% level even with bins as wide as Att = 20 ft,~^Mpc. 

The second question remains more subtle: what is an appropriate TTmax? Formally, we might prefer to keep TTmax = oo, 
which preserves the property that a power law spatial correlation function i^ produces a power law projected 2PCF 
Wp. Realistically, galaxy samples represent finite volumes and the correlation function can only be well estimated to 
a certain maximum distance. We want to keep our TTmax well below this limit, lest we dilute our signal. On the other 
hand, we want TTmax be large enough to (re-)capture clustering strength lost due to redshift distortions. We compute 
'Wp{rp) using two same volume realizations of DM, one in real space (no distortions) and one in redshift space, which 
we compare in Figure [B2] We expand TTmax to range between 20 and 80 h~^Mpc. We see several interesting features. 
First, both real and redshift space Wp{rp) are roughly power laws, with the strongest power corresponding to the 
largest TTmax- We expected this, as the projected 2PCF has units of distance, a larger integration equates to a higher 
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Figure A5. For our fiducial binning scheme, we show the covariance matrices for Qz{6) (top row) and Qproji^) (bottom row) for the 
largest triangles with ri = 9 /i~^Mpc. The solid, dashed and dotted contours correspond respectively to values of 0.70, 0.85 and 0.99. We 
use the LSTAR galaxy sample from DR6 (-21.5 < Mr < -20.5). 



functional value. We notice a much smaller difference between the smallest and largest TTmax in redshift space. This is 
a result of the large scale infall (Kaiser 1987), as clustering strength is compressed to smaller line-of-sight separation 
which we see in Figure [BT] Examining the fractional difference between real and redshift space measurements, we find 
a higher TTmax does what we expect in reducing the difference between real and redshift space Wp{rp) measurements. 
However, we also see that different values of TTmax dramatically changes the 2PCF from the "ideal case" , inducing a 
systematic difference which can range from right below the 10% level at r^ = 10;7ri„ax = 80 h~^Mpc to 80% at the 
largest scales and smallest TTmax- Finally, we conclude that even at TTmax ~ 80 h^^Mpc the effects of redshift distortions 
cannot be completely negated by the projected 2PCF. 

We extend the investigation to the reduced 3PCF, using the two largest triangle configurations (ri — 6 and 
9 /i~^Mpc). Since Q{6) is normalized by the respective 2PCF, we expect the same amplitude between spatial and 
projected measurements and can compare measurements on real observed galaxy samples. We show the measurements 
on an SPS S sample where we measure Q{9) in redshift space and projected with 7r,„ax = 20, 30 and 40 h^^Mpc in 
Figure lB3l For observational galaxies, it appears that Qproj recovers some configuration dependence with larger TTmax- 
The ri = 9 /i~^Mpc triangles show decreased amplitude perpendicular configurations and we note the "splitting" of 
collapsed triangles {6 ~ 0) in the smaller triangles. In both cases, a decreased TTmax yields a Q{0) that approaches the 
redshift space measurement, in agreement with our expectations. 

Overall, we conclude that the projected correlation function reduces the impact of redshift distortions on measure- 
ments of clustering. However, the projected statistic does not completely remove effects of the distortions for any 
of practical values of TTmax that we might use. Our results suggest a larger 7r,„ax rnight further minimize redshift 
distortions but a more thorough investigation is required to disentangle the systematics (for example, we integrate 
the projected 3PCF with a single bin of TTmax width which we would have to investigate). Since the computational 
efficiency of estimating the 3PCF decreases dramatically with increased 7r,„ax, we choose to use TTmax — 20 /i^^Mpc for 
our measurements. This choice should be sufficient to minimize some effects of redshift distortions, remaining most 
effective at small scales where it will recapture clustering strength lost from the non-linear collapse in dense regions 
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Figure Bl. On the left we show the rp — n diagram from the 2PCF on dark matter particles from the Hubble Volume simulation, 
using a grid of 2 h~^Mpc in both rp and tt. The HV particles are trimmed to match the same volume and footprint of SDSS data, with 
velocities used to construct radial redshift distortions. The red (solid) lines are contours of a specific value of the CF, 5(r"p,vr), with values 
of 3, 2, 1, 0.6, 0.4, 0.2, 0.1, 0.07, 0.04 with the yellow highlighted one corresponding to ^{rp, tt) = 1 . The blue (dot-dashed) semi-circles show 
a perfect isotropic correlation for comparison. On the right, we depict the accuracy of the Wp{rp) integration using different sized it bins, 
which we show as a function of projected scale (rp). We obtain the "truth" value by comparing to a fiducial Att = 2 h~^Mpc, and our 
•""max = 40 h~^y[pc. We show the 1% level of accuracy with the dashed line. 

(the fingers-of-god). 
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Figure B2. We show the projeeted 2PCF, uip(rp), for dark matter particles in the Hubble Volume simulation both in real space (top left 
panel), redshift space (top right panel), the absolute difference due to redshift distortions (bottom left panel), and the fractional difference 
(bottom right panel). The different curves correspond to different TTmax integrations, as denoted. We estimate Wp{rp) by integrating in bins 
of Att = 5 ?i~^Mpc. The legend in the top two panels denotes the maximum TTmax used for each calculation, such that "pi20" corresponds 
to TTmax = 20 h~ Mpc, etc. 
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Figure B3. We show the reduced 3PCF, Q{8), on a galaxy sample from SDSS DR5 at our two largest triangle scales: ri = 6 & 9 /i ^Mpc. 
We compare three different values of TTmax, namely 20,30 and 40 h~^Mpc with the spatial reduced 3PCF in redshift space. We note the 
lower values of TTmax tend toward the redshift space measurement, but emphasize that it is a minor effect at these scales. 
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